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Alerting Abstract US Al 

NOVELTY - The registers and hold latches in the multi- threading register file, store the data representing two 
instruction threads. A thread control unit (216) provides a thread control signal to the hold latches and the registers 
for selecting a thread. 

DESCRIPTION - An INDEPENDENT CLAIM is included for data processing system. 
USE - For symmetric multiprocessor (SMP) system used in computer. 

ADVANTAGE - The cost of implementing the multi-threading processor is reduced. The use of processor's 
resource is maintained by switching between two types of multi-threading, thus when long latency occurs in one 
thread, the execution of the other thread is not slowed down. 

DESCRIPTION OF DRAWINGS - The figure shows the block diagram of the reduced instruction set chip (RISC) 
processor. 

2 1 6 Thread control unit 
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Multi-threading processor used in computer, has thread control unit which provides thread control signal to 
hold latches and registers for selecting specified thread Original Titles:Mixed-mode hardware multithreading 
Alerting Abstract ...NOVELTY - The registers and hold latches in the multi-threading register file, store the 
data representing two instruction threads. A thread control unit (216) provides a thread control signal to the hold 
latches and the registers for selecting a thread. ...ADVANTAGE - The cost of implementing the multi-threading 
processor is reduced. The use of processor's resource is maintained by switching between two types of 
multi-threading, thus when long latency occurs in one thread, the execution of the other thread is not slowed 
down... Title Terms .../Index Terms/Additional Words: LATCH; REGISTER; Class Codes International Patent 
Classification IPC Class Level Scope Position Status Version Date G06F-009/00 Main Manual Codes (EPI/S-X); 

T01-F02C T01-F03B T01-M02C Original Publication Data by AuthorityOriginal Abstracts: A 

mixed-mode multithreading processor is provided. In one embodiment, the multi-mode multithreading processor 
includes a multithreaded register file with a plurality of registers, a thread control unit, and a plurality of hold 
latches. Each of the hold latches and registers stores data representing a first instruction thread and a second 
instruction thread. The thread control unit provides thread control signals to each of the hold latches and registers 
selecting a thread using the data. The thread control unit provides control signals for interleaving multithreading 
except when a long latency operation is detected in one of the threads. During a predetermined period corresponding 
approximately to the duration of the long latency operation, the thread control unit places the processor in a mode in 
which only instructions corresponding to the other thread are read out of the hold latches and registers. Once the 
predetermined period of time has expired, the processor returns to interleaving multithreading. Claims: What is 
claimed is: 1. A multithreading processor, comprising: a thread control unit; a multithreaded register file having 
a plurality of registers; and a plurality of hold latches, wherein each of a plurality of the registers in the 
multithreaded register file and each of a plurality of the hold latches stores data representing a first instruction 
thread and a second instruction thread; and the thread control unit provides a thread control signal to said hold 
latches and registers selecting a thread using said data. 
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Alerting Abstract WO A2 

NOVELTY - Data in a data path is synchronized by a pipeline register that has a driver and a number of switched 
storage elements (110) coupled to the driver. The driver drives storage elements selected by a multiple-bit selection 
switch. 

DESCRIPTION - An INDEPENDENT CLAIM is included for: 



1 . a method of operating a pipeline register 

2. an integrated circuit device for synchronization of data in a data path 
USE - For data processing applications. 



ADVANTAGE - The system reduces the amount of additional resources for storing inactive states and switching 
between states. 

DESCRIPTION OF DRAWINGS - The drawing shows a schematic block diagram of a pipeline for a processor that 
utilizes multiple-bit register flip-flop. 
1 10 storage element 
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Data synchronization processing element for data processing applications includes pipeline register with 

multiple-bit selection switch Original Titles:High speed multiple-bit flip-Hop HIGH SPEED 

MULTIPLE-BIT FLIP-FLOP Alerting Abstract ...NOVELTY - Data in a data path is synchronized by a pipeline 
register that has a driver and a number of switched storage elements (110) coupled to the driver. The driver drives 

storage elements selected by a multiple a method of operating a pipeline register an integrated circuit device for 

synchronization of data in a data path DESCRIPTION OF DRAWINGS - The drawing shows a schematic block 

diagram of a pipeline for a processor that utilizes multiple-bit register flip-flop.Title Terms .../Index 
Terms/Additional Words: REGISTER; Class Codes International Patent Classification IPC Class Level Scope 
Position Status Version Date G06F-0009/38... G06F-0009/38... Original Publication Data by AuthorityOriginal 
Abstracts:A vertical multi-threading processor includes one or more execution pipelines that are formed from a 
plurality of multiple-bit pipeline register flip-flops. The multiple-bit pipeline register flip-flops supply multiple 
storage bits. The individual bits of a multiple-bit pipeline register flip-flop store data for one of respective multiple 
threads or processes. When an executing (first) process stalls due to a stall condition, for example a cache miss, an 
active bit of the multiple-bit register flip-flop is stalled, removed from activity on the pipeline, and a previously • 
inactive bit becomes active for executing a previously inactive (second) process. All states of the stalled first process 

are preserved in a temporarily inactive bit of the individual multiple-bit register flip-flop in each pipeline stage 

A vertical multi-threading processor (100) includes one or more execution pipelines (200) that are formed from a 
plurality of multiple-bit pipeline register flip-flops (400). The multiple-bit pipeline register flip-flops supply 
multiple storage bits. The individual bits of a multiple-bit pipeline register flip-flop store data for one of respective 
multiple threads or processes. When an executing (first) process stalls due to a stall condition, for example a cache 
miss, an active bit of the multiple-bit register flip-flop is stalled, removed from activity on the pipeline, and a 
previously inactive bit becomes active for executing a previously inactive (second) process. All states of the stalled 
first process are preserved in a temporarily inactive bit of the individual multiple-bit register flip-flop in each 
pipeline stage... Claims:A pipeline register for synchronizing data in a data path comprising:a driver;a plurality of 



switched storage elements coupled to the driver, the driver for driving a... 
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multi-thread execution pathways, so that pathways are converged into single-pathway including non-stalling 
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Alerting Abstract WO A2 

NOVELTY - The processor uses a multi-thread execution pipeline with execution pathways allocated to respective 
execution threads. A non-stalling component is coupled to multi-thread execution pathways, so that the pathways 
are converged into single-pathway including the non-stalling component. 

DESCRIPTION - The non-stalling component is selected from caches, translation look-aside buffers (TLBs), load 
buffer asynchronous interfaces and external memory management unit (MMU) interface. A thread tagging logic 
coupled to the non-stalling component, sets a thread identifier (TID) tag identifying threads in non-stalling 
component. The non-stalling component shared between several threads, maintains compatibility among threads by 
physical duplication of structures and by verifying the communication status after transfer of thread. An 
INDEPENDENT CLAIM is also included for operating method of multi-thread processor. 
USE - Multi-thread processor for commercial computer applications including embedded, desktop and server 
applications, and for handling operations such as OLTP, DSS, data mining, financial forecasting, mechanical and 
electronic computer-aided design (MCAD/ECAD). And also for web servers, data servers, etc. 
ADVANTAGE - Reduces wasted cycle time resulting from stalling and idling, and increases the proportion of 
execution time, by supporting and implementing both vertical and horizontal multithreadings. Increases system 
parallelism by forming several processor cores in a single die. Advances in on-chip multiprocessor horizontal 
threading, are realized, as the processor core sizes are reduced by technological advancements. Vertical 
multi-threading overcomes or hides cache miss stalls, thereby improves performance in commercial multiprocessor 
and multi-threading applications. 

DESCRIPTION OF DRAWINGS - The figure shows the schematic perspective diagram of multi-dimensional 
register file. 
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Multi-thread processor for commercial computer applications, couples non-stalling component with 



multi-thread execution pathways, so that pathways are converged into single-pathway including non-stalling 
component Original Titles:M ultiple-thread processor with single-thread interface shared among threads... 

...Multiple-thread processor with in-pipeline, thread selectable storage Multiple-thread processor with 

single-thread interface shared among threads Multiple-thread processor with single-thread interface shared 

among threads Multiple-thread processor with in-pipeline, thread selectable storage... 

...MULTIPLE-THREAD PROCESSOR WITH SINGLE-THREAD INTERFACE SHARED AMONG THREADS 
Alerting Abstract ...NOVELTY - The processor uses a multi-thread execution pipeline with execution pathways 
allocated to respective execution threads. A non-stalling component is coupled to multi-thread execution pathways, 
so that the pathways are converged into single-pathway including the non-stalling component, ...logic coupled to the 
non-stalling component, sets a thread identifier (TID) tag identifying threads in non-stalling component. The 
non-stalling component shared between several threads, maintains compatibility among threads by physical 
duplication of structures and by verifying the communication status after transfer of thread. An INDEPENDENT 

CLAIM is also included for operating method of multi-thread processor USE - Multi-thread processor for 

commercial computer applications including embedded, desktop and server applications, and for handling 

operations such as OLTP, DSS, data mining, financial forecasting, mechanical and ADVANTAGE - Reduces 

wasted cycle time resulting from stalling and idling, and increases the proportion of execution time, by supporting 
and implementing both vertical and horizontal multi-threadings. Increases system parallelism by forming several 
processor cores in a single die. Advances in on-chip multiprocessor horizontal threading, are realized, as the 
processor core sizes are reduced by technological advancements. Vertical multi-threading overcomes or hides 
cache miss stalls, thereby improves performance in commercial multiprocessor and multi-threading applications... 
...DESCRIPTION OF DRAWINGS - The figure shows the schematic perspective diagram of multi-dimensional 
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increases the proportion of execution time, by supporting and implementing both vertical multithreading and 
horizontal multithreading. Vertical multithreading permits overlapping or "hiding" of cache miss wait times. In 
vertical multithreading, multiple hardware threads share the same processor pipeline. A hardware thread is 
typically a process, a lightweight process, a native thread, or the like in an operating system that supports 
multithreading. Horizontal multithreading increases parallelism within the processor circuit structure, for example 

within a single integrated circuit die that makes up a single-chip processor. To further increase A processor 

reduces wasted cycle time resulting from stalling and idling, and increases the proportion of execution time, by 
supporting and implementing both vertical multithreading and horizontal multithreading. Vertical 
multithreading permits overlapping or "hiding" of cache miss wait times. In vertical multithreading, multiple 
hardware threads share the same processor pipeline. A hardware thread is typically a process, a lightweight process, 
a native thread, or the like in an operating system that supports multithreading. Horizontal multithreading 
increases parallelism within the processor circuit structure, for example within a single integrated circuit die that 
makes up a single-chip processor. To further increase... Claims: What is claimed is:l. A processor comprising:a 
multiple-thread execution pipeline including a plurality of functional units allocated to an execution thread of a 

plurality of execution threads; anda single-thread interface coupled What is claimed is:l. A vertically 

multi-threaded processor comprisingra register file structure replicated to simultaneously represent register state 
for at least two threads; andat least one pipeline sharable amongst the two threads and coupled to the register file 
structure, the sharable pipeline including replicated thread selectable storage elements defined therein, whereby the 
processor freezes in the pipeline, for later resumption, active state... ... What is claimed is: 1 . A processor 

comprising:a multiple-thread execution pipeline including a plurality of functional units allocated to an execution 

thread of a plurality of execution threads; anda single-thread interface coupled thread interface being shared 

among threads and maintaining thread compatibility by physical duplication of structures and by verifying 



communication status after thread transfer, wherein:the multiple-thread execution pipeline includes a plurality of 
pulse-based high-speed multiple-bits flip-flops, the pulse-based high-speed multiple-bits flip-flops having a latch 
structure coupled to a plurality of select-bus lines, the select-bus lines selecting data in the pulsed-based high-speed 

multiple-bits flip-flops corresponding to an active thread from among the plurality of execution threads What is 

claimed is: 1 . A processor comprising:a multiple-thread execution pipeline including a plurality of pipelines 
respectively allocated to a plurality of execution threads, respective ones of the plurality of pipelines to execute the... 
... of the plurality of pipelines is to execute more than one of the plurality of execution threads in a second thread 
dimension, and wherein the multiple-thread execution pipeline includes storage elements for holding the plurality 
of threads;a plurality of shared components coupled to the multiple-thread execution pipeline, the shared 
components being coupled in a sequence so that the plurality of pipelines converge into the sequence of shared 
components, the shared components being logic components that control but do not hold threadsja cache control 
unit coupled to the multiple-thread execution pipeline;an LI cache coupled to the cache control unit; 

andanti-aliasing logic coupled to the LI cache so that the LI cache What is claimed is: 1 . A vertically 

multi-threaded processor comprising: a register file structure replicated to simultaneously represent register state 
for at least two threads; andat least one pipeline sharable amongst the two threads and coupled to the register file 
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Alerting Abstract WO A2 

NOVELTY - IC is equipped with processing unit having machine state logic (310,312) including multiple shadow 
machine states respectively allocated to multiple execution threads. Execution pipeline (314) is coupled to machine 
state logic and is shared among execution threads by vertical threading. Load/store units (316,318) are coupled to 
the execution pipe line and are shared among the execution threads. 

USE - For database handling operations such as OLTP, DSS, data mining, financial forecasting, mechanical and 
electronic computer aided design, webservers and data servers. 

ADVANTAGE - Provides improved multithreading circuits and operating methods that are economical in 
resources and avoid costly overhead which reduces processor performance. Allows processor to enter and exit the 
exception handler immediately without waiting to drain the pipeline or queues and without the inherent timing 
penalty of the operating system's software saving and restoring of registers. 

DESCRIPTION OF DRAWINGS - The figure shows the schematic block diagram depicting a design configuration 
for a single vertically threaded processor. 
3 1 0,3 1 2 Machine state logic 
3 1 4 Execution pipe line 
3 1 6,3 1 8 Load/store units 
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Multiple thread processor has execution pipe line coupled to machine state logic of processing unit and shared 
among multiple execution threads by vertical threading operation ...Original TitlesrPROCESSOR WITH 



MULTIPLE-THREAD, VERTICALLY-THREADED PIPELINE PROCESSOR WITH 

MULTIPLE-THREAD, VERTICALLY-THREADED PIPELINE AND OPERATING METHOD THEREOF... 
...PROCESSEUR DOTE D f UN PIPELINE MULTITHREAD, A THREADS VERTICAUX ET SA METHODE 

OPERATOIRE Processor with multiple-thread, vertically-threaded pipeline PROCESSOR WITH 

MULTIPLE-THREAD, VERTICALLY-THREADED PIPELINE... Alerting Abstract ...NOVELTY - IC is 
equipped with processing unit having machine state logic (310,312) including multiple shadow machine states 
respectively allocated to multiple execution threads. Execution pipeline (314) is coupled to machine state logic and 

is shared among execution threads by vertical threading. Load/store units (316,318) are coupled ADVANTAGE 

- Provides improved multithreading circuits and operating methods that are economical in resources and avoid 
costly overhead which reduces processor performance. Allows processor to enter and exit the exception... 
...immediately without waiting to drain the pipeline or queues and without the inherent timing penalty of the 
operating system's software saving and restoring of registers. Class Codes International Patent Classification IPC 
Class Level Scope Position Status Version Date G06F-009/00 Main Manual Codes (EPI/S-X): T01-J05A1... 
...T01-J05B4M Original Publication Data by AuthorityOriginal AbstractsrA processor (300) reduces wasted cycle 
time resulting from stalling and idling, and increases the proportion of execution time, by supporting and 
implementing both vertical multithreading and horizontal multithreading. Vertical multithreading permits 
overlapping or "hiding" of cache miss wait times. In vertical multithreading, multiple hardware threads share 
the same processor pipeline (314). A hardware thread is typically a process, a lightweight process, a native thread, 
or the like in an operating system that supports multithreading. Horizontal multithreading increases parallelism 
within the processor circuit structure, for example within a single integrated circuit die that makes up a single-chip 

processor. To further increase system parallelism in some A processor reduces wasted cycle time resulting from 

stalling and idling, and increases the proportion of execution time, by supporting and implementing both vertical 
multithreading and horizontal multithreading. Vertical multithreading permits overlapping or "hiding" of cache 
miss wait times. In vertical multithreading, multiple hardware threads share the same processor pipeline. A 
hardware thread is typically a process, a lightweight process, a native thread, or the like in an operating system 
that supports multithreading. Horizontal multithreading increases parallelism within the processor circuit 
structure, for example within a single integrated circuit die that makes up a single-chip processor. To further 

increase system parallelism in some processor embodiments, multiple processor cores A processor (300) 

reduces wasted cycle time resulting from stalling and idling, and increases the proportion of execution time, by 
supporting and implementing both vertical multithreading and horizontal multithreading. Vertical 
multithreading permits overlapping or "hiding" of cache miss wait times. In vertical multithreading, multiple 
hardware threads share the same processor pipeline (314). A hardware thread is typically a process, a lightweight 
process, a native thread, or the like in an operating system that supports multithreading. Horizontal multithreading 
increases parallelism within the processor circuit structure, for example within a single integrated circuit die that 
makes up a single-chip processor. To further increase system parallelism in some processor embodiments, multiple 
processor cores are formed in a single die.Advances in on-chip multiprocessor horizontal threading are gained as 

processor core sizes are reduced through technological advancements CIaims:Prozessor-Pipeline (314), die in 

der Lage ist, gleichzeitig eine Vielzahl von Ausfuhrungsthreads einschliesslich einem oder mehrerer Schattenthreads 
unter Verwendung von Pipelineregistern aus Multi-Bit Flip-Flops (400) zu halten, wobei immer nur einer aus der 
Vielzahl der Ausfuhrungsthreads gleichzeitig aktiv ausgefuhrt wird; undeine Threadsteuerlogik (310, 312), die mit 

der gemeinsamen Prozessor-Pipeline verbunden ist comprising:a shared processor pipeline (314) that is capable 

of concurrently holding a plurality of execution threads including one or more shadow threads using pipeline 
registers of multi-bit flip-flops (400), one of the plurality of execution threads being actively executed at one time; 
anda thread control logic (310, 312) coupled to the shared processor pipeline that is capable of controlling the shared 

processor pipeline to select a de la possession simultanee d'une pluralite de threads d'execution, incluant un ou 

plusieurs threads ombre, utilisant des registres de canaux de donnees de flip-flops multi-bits (400), Tune de la 
pluralite de threads d'execution etant executee activement une fois: etune logique de commande pour les threads 



(310, 312) couplee au canal de donnees du processeur partage qui est capable de commander au canal de donnees du 

processeur partage de selectionner un 1. A processor comprising:a shared processor pipeline that includes 

therein a plurality of multiple-bit flip-flops, each multiple-bit flip-flop capable of concurrently holding in the shared 
processor pipeline, at least a portion of thread state for a plurality of execution treads, one of the execution threads 
being actively executed in the shared processor pipeline at a given time; andthread control logic coupled to the 
shared processor pipeline and cap able of controlling the shared processor pipeline to select thread state for an active 
one of the execution threads, including the portion of the thread state represented in the multiple-bit flip-flops of the 
shared processor pipeline. 
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Multi-thread processor for commercial computer applications, has multi-threaded processor core converted 
from single thread processor core, which maintains terminal connections of single thread processor core 
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Alerting Abstract WO A2 

NOVELTY - The processor includes a multi-threaded processor core which is converted from a single-thread 
processor core, using multi-bit, thread selectable flip-flops. The multithreaded processor core having an aspect 
ratio, maintains terminal connections of single-thread processor core. 

DESCRIPTION - The multi-threaded pipeline in the multi -threaded processor core, includes several multi-bit, 
thread-selectable master-slave flip-flops which globally replace the single bit master slave flip-flops of single 
thread processor core, but maintains the same footprint as the single bit master slave flip flops. An INDEPENDENT 
CLAIM is also included for retrofitting method of single-thread processor with multi-thread processor. 
USE - Multi-thread processor for commercial computer applications, and for handling operations such as OLTP, 
TSS, data mining, financial forecasting, mechanical and electronic computer-aided design, (MC AD/EC AD). And 
also for web servers, data servers, etc. 

ADVANTAGE - Reduces wasted cycle time resulting from stalling and idling, and increases the proportion of 
execution time, by supporting and implementing both vertical and horizontal multi threadings. Increases system 
parallelism by forming several processor cores in single die. Advances in on-chip multiprocessor horizontal 
threading, are realized, as the processor core sizes are reduced by technology advancements. During designing, 
vertical and horizontal multithreadings are achieved with minimal retrofitting of existing processor core, thereby 
reducing logic and physical design changes and avoiding global chip re-routing, recomposing, and the expense of 



heavy redesigning of integrated circuits. Vertical multithreading provides advantageous effects in sequential and 
parallel processing applications with frequent context switches, reliably. 

DESCRIPTION OF DRAWINGS - The figure shows schematic perspective diagram of multi-dimensional register 
file. 

Title Terms /Index Terms/Additional Words: MULTI; THREAD; PROCESSOR; COMMERCIAL; COMPUTER; 
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Multi-thread processor for commercial computer applications, has multi-threaded processor core converted 
from single thread processor core, which maintains terminal connections of single thread processor core 
Original TitIes:MULTI-THREADED PROCESSOR BY MULTIPLE-BIT FLIP- FLOP GLOBAL 

SUBSTITUTION MULTI-THREADED PROCESSOR BY MULTIPLE-BIT FLIP- FLOP GLOBAL 

SUBSTITUTION Alerting Abstract ...NOVELTY - The processor includes a multi-threaded processor core which 
is converted from a single-thread processor core, using multi-bit, thread selectable flip-flops. The multithreaded 
processor core having an aspect ratio, maintains terminal connections of single-thread processor core. 
DESCRIPTION - The multi-threaded pipeline in the multi -threaded processor core, includes several multi-bit, 
thread-selectable master-slave flip-flops which globally replace the single bit master slave flip-flops of single 
thread processor core, but maintains the same footprint as the single bit master slave flip flops. An INDEPENDENT 

CLAIM is also included for retrofitting method of single-thread processor with multi-thread processor USE - 

Multi-thread processor for commercial computer applications, and for handling operations such as OLTP, TSS, 
data mining, financial forecasting, mechanical and electronic computer-aided design, (MC AD/EC AD... 
...ADVANTAGE - Reduces wasted cycle time resulting from stalling and idling, and increases the proportion of 
execution time, by supporting and implementing both vertical and horizontal multi threadings. Increases system 
parallelism by forming several processor cores in single die. Advances in on-chip multiprocessor horizontal 
threading, are realized, as the processor core sizes are reduced by technology advancements. During designing, 
vertical and horizontal multithreadings are achieved with minimal retrofitting of existing processor core, thereby 
reducing logic and physical design changes and avoiding global chip re-routing, recomposing, and the expense of 
heavy redesigning of integrated circuits. Vertical multithreading provides advantageous effects in sequential and 

parallel processing applications with frequent context switches, reliably DESCRIPTION OF DRAWINGS - The 

figure shows schematic perspective diagram of multi-dimensional register file. Class Codes International Patent 

Classification IPC Class Level Scope Position Status Version Date G06F-009/00 G06F-009/30 Main Manual 

Codes (EPI/S-X): T01-F02C T01-H03C T01-H05B1 T01-M02C2 Original Publication Data by 

AuthorityOriginal Abstracts:A processor improves throughput efficiency and exploits increased parallelism by 



introducing multithreading to an existing and mature processor core. The multithreading is implemented in two 
steps including vertical multithreading and horizontal multithreading. The processor core is retrofitted to support 
multiple machine states. System embodiments that exploit retrofitting of an existing processor core advantageously 
leverage hundreds of man-years of hardware and software development by extending the lifetime of a proven 
processor pipeline generation. A processor implements N-bit flip-flop global substitution. To implement multiple 
machine states, the processor converts 1-bit flip-flops in storage cells of the stalling vertical thread to an N-bit 

global flip-flop where N is the number of vertical threads A processor improves throughput efficiency and 

exploits increased parallelism by introducing multithreading to an existing and mature processor core (300). The 
multithreading is implemented in two steps including vertical multithreading and horizontal multithreading. The 
processor core is retrofitted to support multiple machine states. System embodiments that exploit retrofitting of an 
existing processor core advantageously leverage hundreds of man-years of hardware and software development by 
extending the lifetime of a proven processor pipeline generation. A processor implements N-bit flip-flop global 
substitution. To implement multiple machine states, the processor converts 1-bit flip-flops in storage cells of the 
stalling vertical thread to an N-bit global flip-flop where N is the number of vertical threads. Claims: What is 
claimed is:l. A processor comprising:a multiple-threaded processor core converted from a single-thread processor 
core and maintaining area, aspect ratio, and terminal connections of the single-thread processor core. 
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Multiple thread processor has thread switch logic for switching execution threads according to thread 
switching mode selected from multiple thread switching modes 
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Alerting Abstract WO A2 

NOVELTY - Multiple thread execution pipeline has multiple pipelines allocated to multiple execution threads. 
Thread switch logic coupled to the multiple thread execution pipeline, switches execution threads according to 
thread switching mode selected from multiple thread switching modes. One thread switching mode is an oblivious 
mode switching threads for every N selected number of cycles. 

DESCRIPTION - An INDEPENDENT CLAIM is also included for method of operating processor. 

USE - For database handling operations such as OLTP, DSS, data mining, financial forecasting, mechanical and 

electronic computer aided design, web servers, data servers. 

ADVANTAGE - Provides improved multi-threading circuits and operating methods that are economical in 
resources and avoids costly overhead which reduces processor performance. Allows the processor to enter and exit 
the exception handler immediately without waiting to drain the pipeline or queues and without inherent timing 
penalty of the operating system's software saving and restoring of registers. 

DESCRIPTION OF DRAWINGS - The figure shows the schematic block diagram of anti-aliasing logic for usage in 
various processor implementations including cache. 

Title Terms /Index Terms/Additional Words: MULTIPLE; THREAD; PROCESSOR; SWITCH; LOGIC; 
EXECUTE; ACCORD; MODE; SELECT 
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Multiple thread processor has thread switch logic for switching execution threads according to thread 
switching mode selected from multiple thread switching modes Original 

TitlestTHREADUMSCHALTUNGSLOGIK IN EINEM MULTITHREADPROZESSOR THREAD SWITCH 

LOGIC IN A MULTIPLE-THREAD PROCESSOR THREADUMSCHALTUNGSLOGIK IN EINEM 

MULTITHREADPROZESSOR THREAD SWITCH LOGIC IN A MULTIPLE-THREAD PROCESSOR... 

...Thread switch logic in a multiple-thread processor THREAD SWITCH LOGIC IN A 

MULTIPLE-THREAD PROCESSOR Alerting Abstract ...NOVELTY - Multiple thread execution pipeline has 



multiple pipelines allocated to multiple execution threads. Thread switch logic coupled to the multiple thread 
execution pipeline, switches execution threads according to thread switching mode selected from multiple thread 
switching modes. One thread switching mode is an oblivious mode switching threads for every N selected number of 
cycles. ...ADVANTAGE - Provides improved multi-threading circuits and operating methods that are economical 
in resources and avoids costly overhead which reduces processor performance. Allows the processor to enter and 

exit the handler immediately without waiting to drain the pipeline or queues and without inherent timing penalty 

of the operating system's software saving and restoring of registers. Class Codes International Patent Classification 

IPC Class Level Scope Position Status Version Date G06F-009/00 G06F-009/30 Main G06F-009/38... 

...G06F-009/48 G06F-009/52 Manual Codes (EPI/S-X): T01-F03B1 T01-M02C2 Original Publication 

Data by Authority.. .Original Abstracts:that performs a fast thread-switching operation in response to an LI cache 
miss stall. The fast thread-switching operation implements one or more of several thread-switching methods. A 
first thread-switching operation is "oblivious" thread-switching for every N cycle in which the individual flip-flops 
locally determine a thread-switch without notification of stalling. The oblivious technique avoids usage of an extra 

global interconnection between threads for thread selection. A second thread-switching operation as fast as 

possible, similar to a clock tree distribution. In some systems, a processor derives a thread select signal that is 

applied to the flip-flops by overloading a scan enable (SE) signal of a scannable flip-flop logic that performs a 

fast thread-switching operation in response to an Llcache miss stall. The fast thread-switching operation implements 
one or more of several thread-switching methods. A first thread-switching operation is "oblivious" 
thread-switching for every N cycle in which the individual flip-flops locally determine a thread-switch without 
notification of stalling. The oblivious technique avoids usage of an extra global interconnection between threads for 

thread selection. A second thread-switching operation is "semi-oblivious" thread-switching as fast as possible, 

similar to a clock tree distribution. In some systems, a processor derives a thread select signal that is applied to the 

flip-flops by overloading a scan enable (SE) signal of a scannable flip-flop that performs a fast 

thread-switching operation in response to an LI cache miss stall. The fast thread-switching operation implements 
one or more of several thread-switching methods. A first thread-switching operation is "oblivious" 
thread-switching for every N cycle in which the individual flip-flops locally determine a thread-switch without 
notification of stalling. The oblivious technique avoids usage of an extra global interconnection between threads 
for thread selection. A second thread-switching operation is "semi-oblivious" thread-switching for use with an 

existing "pipeline stall" signal (if any as fast as possible, similar to a clock tree distribution. In some systems, a 

processor derives a thread select signal that is applied to the flip-flops by overloading a scan enable (SE) signal of a 
scannable flip-flop. ClaimsrProzessor, umfassend:eine Multithread- Executionpipeline (314) mit einer Vielzahl 
von Pipelines, die jeweils einer Vielzahl von Executionthreads (122, 124, 126 und 128) zugeordnet sind; undeine 
Threadumschaltlogik (610), die mit der Multithread-Executionpipeline gekoppelt ist, wobei die 
Threadumschaltlogik die Executionthreads nach einer Thread-Umschaltbetriebsweise umschaltet, die aus einer 
Vielzahl von Thread-Umschaltbetriebsweisen ausgewahlt wird, wobei die Multithreadstruktur gleichzeitig Daten 
von mehreren Threads unter Verwendung von Multiple-Bit-Flip-Flops reprasentiert.A processor comprising;a 
multiple-thread execution pipeline (314) including a plurality of pipelines respectively allocated to a plurality of 
execution threads (122, 124, 126, and 128); anda thread switch logic (610) coupled to the multiple-thread 
execution pipeline, the thread switch logic switching execution threads according to a thread switching mode 
selected from among a plurality of thread switching modes wherein a multiple-threaded structure concurrently 

represents data for multiple threads using multiple-bit flip-flops modes de commutation de processus 

elementaires dans lesquels une structure a processus elementaires multiples represente simultanement des donnees 
pour des processus elementaires multiples utilisant des flip-flop a bits multiples.A processor comprising:a 
multiple-thread execution pipeline including a plurality of pipelines respectively allocated to a plurality of 
execution threads; anda thread switch logic coupled to the multiple-thread execution pipeline, the thread switch 
logic that switches execution threads according to a thread switching mode selected from among a plurality of 
thread switching modes. 
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Integrated circuit for computer system, has register file bit comprising context switch mechanism causing 
data on data output of primary latch to be written to secondary latch and vice versa in single clock cycle 
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Latched synchronous output circuit for e.g. cache SRAM for pentium - has latch and parallel slave latch with 
incoming data inverted by swapping inputs from sense amplifier and sensing data on global bar 
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Alerting Abstract US A 

The synchronous output buffer device includes a data input terminal, three clock controlled transfer gates, and two 
latch circuits. The first gate (68) is enabled on one phase of a clock cycle and disabled on another. The second gate 
is coupled to the output of the first latch (75) and has its output coupled to an input terminal of an output driver 
circuit (95). The third gate is coupled between the input of the first latch and an inverted output of the first latch. 
The third gate is disabled on the first phase of the clock and enabled on the second to maintain the input data 
received on the first phase of the clock during the second phase of the clock when the first transfer gate is disabled 
and to accept new input data on the first phase of the clock when the first transfer gate is disabled. 
USE/ADVANTAGE - Also for DRAM, EEPROM and flash EEPROM. High speed output buffer due to clocked 
parallel operation. Slave latch holds input data while additional data is presented for input. 

Title Terms /Index Terms/Additional Words: LATCH; SYNCHRONOUS; OUTPUT; CIRCUIT; CACHE; SRAM; 
PARALLEL; SLAVE; INCOMING; DATA; INVERT; INPUT; SENSE; AMPLIFY ; GLOBE; BAR 
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Latched synchronous output circuit for e.g. cache SRAM for pentium has latch and parallel slave latch 

with incoming data inverted by swapping inputs from sense amplifier and sensing data on global bar Alerting 
Abstract ...The synchronous output buffer device includes a data input terminal, three clock controlled transfer 
gates, and two latch circuits. The first gate (68) is enabled on one phase of a clock cycle and disabled on another. 
The second gate is coupled to the output of the first latch (75) and has its output coupled to an input terminal of an 
output driver circuit (95). The third gate is coupled between the input of the first latch and an inverted output of the 

first latch USE/ADVANTAGE - Also for DRAM, EEPROM and flash EEPROM. High speed output buffer 

due to clocked parallel operation. Slave latch holds input data while additional data is presented for input. Title 
Terms /Index Terms/Additional Words: LATCH; Original Publication Data by AuthorityOriginal AbstractsrThe 
time required to output data from an output buffer is significantly reduced by having a slave latch in a parallel 
connection with a master latch. Incoming data is stored in a master latch on a first phase of a clock pulse. On the 
second phase of the clock pulse, the data is output of the master latch and provided to an output driver. A slave 
latch is coupled to the input node of the output driver. On the subsequent phase of the clock, the slave latch is 
switched on to hold the state of the input to the output driver constant. The slave latch thus receives the output of 
the master register in parallel with the output driver and also performs its function of maintaining the input to the 
output buffer for one entire clock pulse while new data is being presented to the master latch. Data is thus 
provided more quickly to the output driver than was previously possible with prior art master/slave configurations. 
...Claims:data through the transfer gate on a first phase of a clock cycle and disabled on a second phase of the clock 
cycle;a first latch circuit coupled to the output of the first transfer gate and receiving the data at the input terminal 
when the first transfer gate is enabled; anda second clock controlled transfer gate coupled to the output of the first 
latch circuit and having its output coupled to an input terminal of an output driver circuit to provide data to the 
driver circuit from the first latch circuit. 



33/5.K/3 (Item 3 from file: 350) Links 
Derwent WPIX 

(c) 2007 The Thomson Corporation. All rights reserved. 

0015499508 Drawing available 

WPI Acc no: 2006-063646/200607 

Related WPI Acc No: 1999-580144; 2000-012972 

XRPX Acc No: N2006-055189 

Interrupt processing and task switching circuitry for use in computer architectures, has task switching circuit 
coupled to zero overhead multiplexing circuits for storing later task while simultaneously switching 
pre-stored earlier task 
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Alerting Abstract US B 1 

NOVELTY - A task enable circuit determines whether a predetermined task is ready for execution by CPU, from 
predetermined inputs. A task priority selection circuit is coupled to output of task enable circuit, for determining an 
order for running of tasks that are ready. A task switching circuit is coupled to output of selection circuit and zero 
overhead multiplexing circuit for storing laster task in set of latches while simultaneously switching pre-stored 
earlier task. 

USE - Interrupt processing and task switching circuitry for use in computer architectures. 
ADVANTAGE - Prevents loss of time by eliminating the need to transfer the running task's data to separate task 
state storage element at the instant of interrupt and task change event. Permits the use of portion of system's task 
memory for effective tracing of processor flow for full speed, real-time use in computer programming debugging. 
Allows linking of tasks for mandatory sequential execution of linked tasks as required in multi-tasking systems. 
Eliminates the need for extensive storage buffers and auxiliary specialized processors with their associated costs and 
delays. Eliminates the latency involved in long multicycle uninterruptable instructions and latency unpredictability. 
DESCRIPTION OF DRAWINGS - The figure shows the block diagram of the interrupt processing and task 
switching circuitry. 
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...are ready. A task switching circuit is coupled to output of selection circuit and zero overhead multiplexing circuit 
for storing laster task in set of latches while simultaneously switching pre-stored earlier task. Original Publication 
Data by Authority.. .Original Abstracts: interrupt latency and task change processing overhead delays in computer 
architectures. Without loss of time, the system performs complete task state saving and restoration between one 
cycle and the next without software intervention. For each Central Processing Unit (1) register, the invention uses 
one or more auxiliary latches (3, 4) wherein one latch (3, 4) is used as the "running" latch and one of the auxiliary 
latches is attached to task storage memory. The invention swaps connections between alternate "running" registers 
and auxiliary registers while transferring other tasks to and from task storage memory (2). The invention provides 
a task linking system to allow the linking of tasks for the mandatory sequential execution of the linked tasks. 
Further, the... Claims: 1. In a microprocessor-based computing system having a CPU for executing tasks represented 
by task register sets and further including peripheral devices that issue interrupt commands, an interrupt and task 

change processing circuit comprising:(a) a task enable circuit for determining from predetermined selection 

circuit, wherein the task switching circuit is coupled to a zero overhead multiplexing circuit for storing a later task in 
a first set of latches during a first clock cycle while simultaneously switching a previously stored earlier task stored 
in a second set of latches into a task switch controller during the same clock cycle. 
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Cyclic redundancy check calculation performing circuit for data communication system, has cyclic 
redundancy check calculation blocks to perform calculation to yield value of n-bit check result, and check 
register to latch value 
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Alerting Abstract US A 1 

NOVELTY - The circuit has a set of cyclic redundancy check (CRC) calculation blocks. Each cyclic redundancy 
check calculation block performs a CRC calculation to yield a value of an n-bit CRC result. A switch selectively 
passes the CRC calculation value. A cyclic redundancy check register latches one of the CRC calculation values 
selectively passed by the switch. The latched value is fedback to the calculation blocks. 
DESCRIPTION - An INDEPENDENT CLAIM is also included for a method of calculating a cyclic redundancy 
check value with a variable width data input. 

USE - Used for performing a cyclic redundancy check calculation in a data communication system, and 
implemented as a field programmable gate array. 

ADVANTAGE - The CRC calculation blocks perform the CRC calculation at a short span of time i.e. within one 
clock cycle, thus offering high speed and flexibility. The blocks perform multiple CRC calculations in parallel, 
hence a cumulative CRC is available from the register when required, and is updated every clock cycle. 
DESCRIPTION OF DRAWINGS - The drawing shows a flowchart of a method of calculating a cyclic redundancy 
check value with a variable width data input. 
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...performing circuit for data communication system, has cyclic redundancy check calculation blocks to 
perform calculation to yield value of n-bit check result, and check register to latch value Alerting Abstract 

...CRC) calculation blocks. Each cyclic redundancy check calculation block performs a CRC calculation to yield a 
value of an n-bit CRC result. A switch selectively passes the CRC calculation value. A cyclic redundancy check 
register latches one of the CRC calculation values selectively passed by the switch. The latched value is fedback 
to the calculation blocks. ...ADVANTAGE - The CRC calculation blocks perform the CRC calculation at a short 
span of time i.e. within one clock cycle, thus offering high speed and flexibility. The blocks perform multiple CRC 
calculations in parallel, hence a cumulative CRC is available from the register when required, and is updated every 
clock cycle... Title Terms .../Index Terms/Additional Words: REGISTER; LATCH Original Publication Data by 
Authority...Original Abstracts: Preferably, multiple CRC calculations are performed in parallel, each CRC 
calculation involving a different number of data bits from the data word and terminating within one clock cycle. The 
CRC polynomial is preferably incorporated into the hardware for each CRC calculation. ...Claims:calculation to 
yield a value of an n-bit CRC result; a switch for selectively passing one of the CRC calculation values; and a CRC 
register for latching the one of the CRC calculation values selectively passed by the switch. 



33/5,K/8 (Item 8 from file: 350) Links 
Derwent WPIX 

(c) 2007 The Thomson Corporation. All rights reserved. 

00 1 3799560 Drawing available 
WP1 Acc no: 2003-899626/200382 
XRPX Acc No: N2003-71 8064 

Serial to parallel data converter for high speed serial transmissions, has pre-register that latches parallel data 
and selector that selects data receivers for serial data conversion and parallel data output, respectively 
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Alerting Abstract US Al 

NOVELTY - The converter has two data receivers (10, 20) of n flip- Hops (1 1, 21) for latching n-bit parallel data. 
A pre- register of m flip-flops latch m-bits of the n-bit parallel data and output is connected to the inputs of both 
the receivers. A selector (40) connected to the flip-flops of the receivers, selects a receiver for serial data 
conversion and another for parallel data output. 

DESCRIPTION - The inputs of the latter data receiver are connected in parallel with that of the former. The inputs 
of the pre-register receive the serial data and the m-bits are shifted back to one of the receivers before each of the 
conversion cycle ends. The flip- flops of both the data receivers are of D-type. An INDEPENDENT CLAIM is also 
included for a serial to parallel data conversion method. 
USE - Used in high-speed serial transmissions. 

ADVANTAGE - The converter generates number of phase clocks signals equal to the width of the bits in the 
parallel data, thereby effectively reducing the number of required clocks and enabling operation at high frequency. 
The pre-register does not need switch control so it does not have the setup time problem during parallel data 
switching. 

DESCRIPTION OF DRAWINGS - The drawing shows a latch circuit diagram for serial-to-parallel conversions in 
multi-clock phase architecture. 
10, 20 Data receivers 
11,21 Flip-flops 



40 Selector 
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Serial to parallel data converter for high speed serial transmissions, has pre-register that latches parallel data 
and selector that selects data receivers for serial data conversion and parallel data output, respectively 
Alerting Abstract ...NOVELTY - The converter has two data receivers (10, 20) of n flip -flops (1 1, 21) for 
latching n-bit parallel data. A pre- register of m flip-flops latch m-bits of the n-bit parallel data and output is 
connected to the inputs of both the receivers. A selector (40) connected to the flip-flops of the receivers, selects a 
receiver for serial data conversion and another for parallel data output. DESCRIPTION - The inputs of the latter data 
receiver are connected in parallel with that of the former. The inputs of the pre-register receive the serial data and 
the m-bits are shifted back to one of the receivers before each of the conversion cycle ends. The flip- flops of both 
the data receivers are of D-type. An INDEPENDENT CLAIM is also included for a serial to parallel data conversion 

method the width of the bits in the parallel data, thereby effectively reducing the number of required clocks and 

enabling operation at high frequency. The pre-register does not need switch control so it does not have the setup 

time problem during parallel data switching DESCRIPTION OF DRAWINGS - The drawing shows a latch 

circuit diagram for serial-to-parallel conversions in multi-clock phase architecture... ...11,21 Flip- flops Title 

Terms .../Index Terms/Additional Words: REGISTER; LATCH; Original Publication Data by 
Authority...Original Abstracts: invention can further solve the setup time problem associated with the switching 
one of two parallel data receivers as the parallel data output. A pre-register is employed in the converter of the 
present invention. Since this pre-register does not need switch control, it does not have the setup time problem 

during parallel data switching invention can further solve the setup time problem associated with the switching 

one of two parallel data receivers as the parallel data output. A pre-register is employed in the converter of the 
present invention. Since this pre-register does not need switch control, it does not have the setup time problem 
during parallel data switching. ...ClaimsrA serial-to-parallel data converter for converting a serial data into a n-bit 
parallel data, comprising:a first data receiver, composed of n flip-flops for latching the n-bit parallel data;a second 
data receiver, composed of n flip-flops with inputs connected in parallel with inputs of the first data receiver, for 
latching the n-bit parallel data;a pre-register, composed of m flip-flop(s) for latching the first m bit(s) of the n-bit 
parallel data and output(s) thereof connected to the inputs of flip-flops in the first data receiver and the second 
data receiver corresponding to the first m bit(s); wherein the inputs of the pre-register receive the serial data, and 
the first m bit(s) is shifted back to one of the first data receiver and the second data receiver before each conversion 



cycle ends; anda selector, connected to the flip-flops of the first data receiver and the second data receiver for 
selecting one of the first data receiver and the second data receiver for serial data conversion and the other for 

parallel data output A serial-to-parallel data converter for converting a serial data into a n-bit parallel data, 

comprisingra first data receiver, composed of n flip-flops for latching the n-bit parallel data referring n phase clock 
signals in a conversion cycle;a second data receiver, composed of n flip-flops with inputs connected in parallel 
with inputs of the first data receiver, for latching the n-bit parallel data by referring said n phase clock signals in 
said conversion cycle;a pre-register, composed of m flip-flop(s) for latching the first m bit(s) of the n-bit parallel 
data and output(s) thereof connected to the inputs of flip-flops in the first data receiver and the second data 
receiver corresponding to the first m bit(s); wherein the inputs of the pre-register receive the serial data, and the 
first m bit(s) is shifted back to one of the first data receiver and the second data receiver before each conversion 
cycle ends; anda selector, connected to the flip-flops of the first data receiver and the second data receiver for 
selecting one of the first data receiver and the second data receiver for serial data conversion and the other for 
parallel data output. 
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Memory cell structure for field programmable gate array, has several flip flops with data, clock inputs 
coupled to respective inputs of shift register and data output coupled to single data bit input of memory cells 
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Alerting Abstract US B 1 

NOVELTY - A shift register has a chain of flip-flops, where clock input of each flip flop is coupled to clock input 
of shift register and data input of specific flip-flop is coupled to data input of shift register. Data output of each 
flip-flop is coupled to single data bit input of a selective data path forming unit of memory cells. 
USE - For storing programming bits in static RAM-based field programmable gate array (FPGA). 
ADVANTAGE - As the flip-flops of all the memory bytes for logic block are coupled together in shift-register, the 
need for regularity in the array structure is eliminated, such that there is no need to place the logic blocks in the 
interstices of memory array. By changing the ROM in a simple manner, functionality can be change from the one 
power up cycle to another, thus design changes in the FPGA is made easier, cheaper and faster, thereby enhances 
and speeds up design cycles. 

DESCRIPTION OF DRAWINGS - The figure shows a circuit diagram of the SRAM control memory for FPGA. 
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Memory cell structure for field programmable gate array, has several flip flops with data, clock inputs 
coupled to respective inputs of shift register and data output coupled to single data bit input of memory cells 
Alerting Abstract ...NOVELTY - A shift register has a chain of flip- flops, where clock input of each flip flop is 
coupled to clock input of shift register and data input of specific flip-flop is coupled to data input of shift register. 
Data output of each flip-flop is coupled to single data bit input of a selective data path forming unit of memory 
cells. ...ADVANTAGE - As the flip-flops of all the memory bytes for logic block are coupled together in 
shift-register, the need for regularity in the array structure is eliminated, such that there is no need to place the logic 
blocks in the interstices of memory array. By changing the ROM in a simple manner, functionality can be change 
from the one power up cycle to another, thus design changes in the FPGA is made easier, cheaper and faster, thereby 
enhances and speeds up design cycles... Title Terms .../Index Terms/Additional Words: REGISTER; Original 
Publication Data by Authority... Original Abstracts: FPGA eliminates the need for a regular array of word lines and 
bit lines. The memory includes memory bytes, each of which has eight SRAM latches, a single flip-flop and a 
one-of-eight decoder having data input coupled to the inverting output of the flip-flop and eight individual data 
outputs, each of which is coupled to a data input of one of the SRAM latches. The flip-flops of all memory bytes 
for a logic block are coupled together in a serpentine shift register. Loading of configuration data involves shutting 
down all paths through the decoder, shifting all configuration bits for the "0" position SRAM latch of each memory 
byte into the shift register, and setting the address bits to the decoder so as to create a conductive path on each 
memory byte from the output of the flip-flop to the data input of the 0 latch. The process is then repeated for the 
seven other SRAM latch positions. ...Claims: data bit input and one of said plurality of outputs depending on the 
address bits received at said address input; andat least one shift register having a data input and a clock input, for 
receiving at said data input a string of data bits in serial format which are shifted into said shift register in 
accordance with clock signals received at said clock input, said shift register comprising a chain of flip-flops, each 
flip-flop having a clock input, a data input, and at least one data output, wherein the clock input of each flip-flop 
is coupled to the clock input of said shift register; the data input of one flip-flop in the chain is coupled to the data 
input of said shift register; the data input of every flip-flop other than said one is coupled to a data output of the 
previous flip-flop in said chain; and a data output of each flip-flop is coupled to the single data bit input of the 
selective data path forming means of one of said groups of memory cells. 
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Extended mode transfer instruction speed-up apparatus, has selector that selects absolute address from adder 
and main address generator, when control signals are in predetermined state 
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Alerting Abstract US A 

NOVELTY - Selector (2-6) selects any one of the control signals generated corresponding to predetermined transfer 
instructions. A latching unit latches the selected control signal into base register (4-6), When the signal changes 
state. Another selector (8-6) selects absolute value from an adder (6-6) and from main address generator (82A), 
when the control signals are in their respective state. 

DESCRIPTION - A testing unit determines whether current instruction is one of predetermined transfer instructions. 
The first control signal showing state of presence of the transfer instruction is output by an output unit. The base 
address stored in base register (4-6) of current instruction is added to offset provided by current instruction for 
generating absolute address by an adder. If the state of first control signal changes, another control signal having 
state similar to that of first control signal, is generated. 
USE - For general purpose digital data processing system. 

ADVANTAGE - Reduces response and computation time of extended mode transfer instructions in predetermined 
time for certain selected transfer instructions. 

DESCRIPTION OF DRAWINGS - The figure shows the block diagram of added instruction cache generator. 
2-6,8-6 Selectors 
4-6 Base register 
6-6 Adder 

82A Address generator 
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Alerting Abstract ...NOVELTY - Selector (2-6) selects any one of the control signals generated corresponding to 
predetermined transfer instructions. A latching unit latches the selected control signal into base register (4-6). 
When the signal changes state. Another selector (8-6) selects absolute value from an adder (6-6) and from main 

address generator (82A instructions. The first control signal showing state of presence of the transfer instruction 

is output by an output unit. The base address stored in base register (4-6) of current instruction is added to offset 

provided by current instruction for generating absolute address by an adder. If the state of first 4-6 Base register 

Original Publication Data by Authority... Original Abstracts:a fast transfer is possible. The fast transfer process 
requires fewer checks than the previous apparatus which accelerates the response to selected transfer instructions by 
one cycle. ...Claims:first control signal with a second state; c) adder means for generating an absolute first address 
by adding the base address stored in the base register of the current instruction to an offset also provided by the 
current instruction; d) means for generating a second control signal, which changes state, one memory cycle after 
the first control signal changes state, to the same state as the first control signal; e) first selection means for selecting 
a first value when the first control signal is in the first state, and selecting a second value when the first control 
signal is in the second state; f) latching means for latching the first selection means selection into the base register 
whenever the first control signal changes state; g) second selection means for selecting the absolute value from the 
adder means when the second control signal is in the first state and for selecting the absolute value from the main... 
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Zero overhead computer interrupts circuit with task switching used in computer architectures 
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Alerting Abstract WO A 1 

NOVELTY - Multiplexers (14,15) having inputs from the central processing unit (CPU) (1) and task storage 
memory (2) are each connected to auxiliary latches (3,4). Additional multiplexers have inputs from each latch, one 
(13) connecting to an input of the task storage memory and one (17), the "running" latch, connecting to the CPU, 
Connections are exchanged between alternate "running" registers and auxiliary registers while transferring other 



tasks to and from task storage memory. 

USE - The system is used in computer architectures. 

ADVANTAGE - The system allows the mandatory sequential execution of linked tasks and provides a counter to 
increase priorities of various tasks as their deadline approaches. Delays and costs are reduced by eliminating storage 
buffers and specialized processors. A deterministic architecture structure is created. 

DESCRIPTION OF DRAWINGS - The figure shows a circuit diagram of an interrupt and task exchange system. 

1 Central processing unit 

2 Task storage memory 
3,4 Auxiliary latches 
13,14,15,17 Multiplexers 
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Alerting Abstract ...NOVELTY - Multiplexers (14,15) having inputs from the central processing unit (CPU) (1) 
and task storage memory (2) are each connected to auxiliary latches (3,4). Additional multiplexers have inputs from 
each latch, one (13) connecting to an input of the task storage memory and one (17), the "running" latch, connecting 
to the CPU. Connections are exchanged between alternate "running" registers and auxiliary registers while 
transferring other tasks to and from task storage memory. ...3,4 Auxiliary latches Original Publication Data by 
Authority... Original Abstracts: interrupt latency as task change processing overhead delays in computer 
architectures. Without loss of time, the system performs complete task state saving and restoration between one 
cycle and the next without software intervention. For each Central Processing Unit (1) the invention uses one or 
more auxiliary latches (3, 4) wherein one latch (4) is used as the "running" latch and one of the auxiliary latches is 
attached to task storage memory. The invention swaps connections between alternate "running" registers and 



auxiliary registers while transferring other tasks to and from task storage memory (2). The invention provides a task 

linking system to allow the linking of tasks for interrupt latency as task change processing overhead delays in 

computer architectures. Without loss of time, the system performs complete task state saving and restoration 
between one cycle and the next without software intervention. For each Central Processing Unit (1) the invention 
uses one or more auxiliary latches (3, 4) wherein one latch (4) is used as the "running" latch and one of the 
auxiliary latches is attached to task storage memory. The invention swaps connections between alternate "running" 
registers and auxiliary registers while transferring other tasks to and from task storage memory (2). The invention 
provides a task linking system to allow the linking of tasks for... 
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Digital data processor executing conditional instruction within single machine cycle - has two temporary 
registers for holding two pieces of data read from two general registers specified by instruction, to provide 
architecture for pipeline processing 
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Alerting Abstract US A 

The digital data processor includes a CPU, and a memory from which a condition instruction is fetched and passed 
to an instruction register. A first control circuit responds to source-codes to select the general registers having 
respective pieces of data to be compared. A first latch stores instruction data including a code for an operation to be 
performed by an arithmetic and logic unit (ALU) on the two pieces of data under the control of a second control 
circuit. 

A conditional code register stores a conditional code representing the result of the ALU operation, and a second 
latch stores selection criteria for destination registers specified by the instruction. A selection circuit operates under 
the control of a third control circuit to sort the ALU output data of a third control circuit to the specific destination 
register in accordance with the selection criteria and the condition code. 

ADVANTAGE - Allows pipeline processing to be performed without disturbance or processing time penalty, which 
ensures high-speed performance. Instruction execution is completed within a single CPU cycle. 
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Expandable dynamic random access memory - has motherboard with memory control unit which generates 
row and column addresses as function of memory configuration, memory address and read/write command 
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Alerting Abstract EP Al 

The dynamic random access memory (DRAM) includes a motherboard (60). A memory control unit (5) is mounted 
on the motherboard in order to configure the memory and to generate, as a function of the memory configuration, of 
a memory address and of a read/write command received via a system bus (6). A row address (RADDR) and a 
column address (COL.ADDR) are placed at successive time intervals on an address output channel (17). A pair of 
several distinct timed signals (RAS.XY, CAS.XY. X=A,B,C,D,. Y=0,l-5) validate row and column addresses 
respectively, selectively directed to one and only one of the modules. One of several distinct timed write commands 
WE(0-5) and one of several distinct latch commands ALE(0-5) for the column address are selectively directed to 
one and only one of the memory blocks. 

A first group of connection slots (13-16) are mounted on the motherboard and interconnected with the memory 
control unit in order to receive, the row and column addresses, the write commands, the latch commands and the 
timed signals to validate row and column addresses. One or more memory components of the same type are inserted 
in the slots. The type is either memory modules (1-4) of commercial DIMM type or supports (62) for memory 
modules of commercial DIMM type. The support includes a printed circuit board (37) with edge connector (38) 
insertion into a slot of the first group. A second group of connection slots (39-44) for memory modules of DIMM 
type. At least one memory module of DIMM type is plugged into a slot of the second group and several latch 
registers (45-50) each associated with a slot of the second group. Each register is controlled by one of the latch 
commands to latch the column address received through the edge connector and present it to address terminals of 
the associated slot. 



ADVANTAGE - Allows expansion of memory using non-interleaved DIMM modules connected directly to slots in 
motherboard. 

Title Terms /Index Terms/Additional Words: EXPAND; DYNAMIC; RANDOM; ACCESS; MEMORY; 
CONTROL; UNIT; GENERATE; ROW; COLUMN; ADDRESS; FUNCTION; CONFIGURATION; READ; 
WRITING; COMMAND 
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Alerting Abstract ...A row address (RADDR) and a column address (COL.ADDR) are placed at successive time 
intervals on an address output channel (17). A pair of several distinct timed signals (RAS.XY, CAS.XY. 
X=A,B,C,D,. Y=0,l-5) validate row and column addresses respectively, selectively directed to one and only one of 
the modules. One of several distinct timed write commands WE(0-5) and one of several distinct latch commands 

ALE(0-5) for the column address are selectively directed to one and only one of the memory blocks are mounted 

on the motherboard and interconnected with the memory control unit in order to receive, the row and column 
addresses, the write commands, the latch commands and the timed signals to validate row and column addresses. 

One or more memory components of the same type are inserted in the slots connection slots (39-44) for memory 

modules of DIMM type. At least one memory module of DIMM type is plugged into a slot of the second group and 
several latch registers (45-50) each associated with a slot of the second group. Each register is controlled by one 
of the latch commands to latch the column address received through the edge connector and present it to address 
terminals of the associated slot... Original Publication Data by Authority... Original Abstracts: random access 
memory (DRAM) with variable configuration depending on the number and capacity of standard memory modules, 
of DIMM type (1) plugged into a first plurality of slots (13, 14, 15, 16) of a memory motherboard (61) comprising a 
control unit (5), into which it is possible to plug, into the first plurality of slots (13, 14, 15, 16), in substitution for 
the memory modules (1), expansion supports (62) (Memory Riser), in turn provided with - a second plurality of 
slots (39, 40, 41, 42, 43, 44) for the insertion of standard memory modules of DIMM type (1), - and of column 
address latch registers (45, 50) each associated with a slot of the second plurality and thereby to support and 
allow the configurability and operability of interleaved-block memory, and access cycles, with partial time 
overlap, without renouncing the use of commercially available DIMM memory modules and without burdening the 

basic memory configuration with all the overheads required to support the Dynamic random access memory 

with variable configuration depending on the number and capacity of standard memory modules, of DIMM type 
plugged into a first plurality of slots of a memory motherboard comprising a control unit, into which it is possible 
to plug, into the first plurality of slots, in substitution for the memory modules, expansion supports, in turn 
provided with a second plurality of slots for the insertion of standard memory modules of DIMM type, and of 
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US Classification, Issued: 395375000, 395600000, 364DIG001, 364222900, 364271600, 364271800, 364231800 , 
364962300 

File Segment: EPI; 
DWPI Class: TO 1 

Manual Codes (EPI/S-X): T01-F03 

Digital data processor executing conditional instruction within single machine cycle - has two temporary 

registers for holding two pieces of data read from two general registers specified by instruction, to provide 
architecture for pipeline processing Original Titles: Digital data processor executing a conditional instruction 
within a single machine cycle Alerting Abstract ...The digital data processor includes a CPU, and a memory from 
which a condition instruction is fetched and passed to an instruction register. A first control circuit responds to 
source-codes to select the general registers having respective pieces of data to be compared. A first latch stores 
instruction data including a code for an operation to be performed by an arithmetic and logic unit (ALU) on the two 

pieces of data A conditional code register stores a conditional code representing the result of the ALU 

operation, and a second latch stores selection criteria for destination registers specified by the instruction. A 
selection circuit operates under the control of a third control circuit to sort the ALU output data of a third control 
circuit to the specific destination register in accordance with the selection criteria and the condition code... 
...ADVANTAGE - Allows pipeline processing to be performed without disturbance or processing time penalty, 
which ensures high-speed performance. Instruction execution is completed within a single CPU cycle. Title Terms 
.../Index Terms/Additional Words: REGISTER; Original Publication Data by AuthorityOriginal Abstracts: In a 
digital data processor having a CPU, a condition instruction is fetched from memory to an instruction register. A 
first control circuit responds to source-codes to select the general registers having respective pieces of data to be 
compared. A first latch stores instruction data including a code for an operation to be performed by an arithmetic 
and logic unit (ALU) on the two pieces of data under the control of a second control circuit. A conditional code 
register stores a conditional code, representing the result of the ALU operation, and a second latch stores selection 
criteria for destination registers specified by the instruction. A selection circuit operates under the control of a 
third control circuit to sort the ALU output data of a third control circuit to the specific destination register in 
accordance with the selection criteria and the condition code. The instruction execution is completed within a single 
CPU cycle. ...Claims:processor having a central processing unit operating in response to a stored set of instruction, 
a basic instruction of said instructions being executed within a single machine cycle, said digital processor 
comprising: an instruction register for reading an instruction; a plurality of general registers for storing data; 
means for reading respective pieces of data from two of said plurality of general registers, said two general 
registers being specified by instruction; operation means for applying an operation specified by the instruction to 
said pieces of data supplied from said general registers; means for generating a conditional code from the operated 

data produced by said operation means; means for selecting one of said respective pieces of data from said of 

data is larger or smaller than the other of said respective pieces of data as specified by the instruction, and for 
selecting a general register being specified by the instruction as a destination for the selected piece of data; and 
means for controlling said general registers, said operation means, said condition code generating means, said 
reading means, and said selecting means so that the instruction is executed within a single machine cycle. 



column address latch registers each associated with a slot of the second plurality and thereby to support and 
allow the configurability and operability of interleaved-block memory, and access cycles, with partial time overlap, 
without renouncing the use of commercially available DIMM memory modules and without burdening the basic 

memory configuration with all the overheads required to support the interleaved Claims:a row address 

(RADDR) and a column address (COL.ADDR) placed at successive time intervals on an address output channel 
(17), a pair of a plurality of distinct timed signals (RAS.XY, CAS.XY; X=A,B,C,D,; Y=0,1 ...5) validating row and 
column addresses respectively, selectively directed to one and only one of said modules, one of a plurality of 
distinct timed write commands WE(0...5) and one of a plurality of distinct latch commands ALE(0...5) for said 
column address, selectively directed to one and only one of the said memory blocks, - a first plurality of connection 
slots (13, 14, 15, 16) mounted on said motherboard and interconnected with said memory control unit (5) in order to 
receive, from said memory control unit, said row and column addresses, said write commands WE(0...5), said latch 
commands ALE(0...5) and said plurality of timed signals for validating row and column addresses and, inserted in 
said slots - one or more memory components of the same type, chosen from the following two types: - memory 
modules (1, 2, 3, 4) of commercial DIMM type - supports (62) for memory modules of commercial DIMM type 
comprising a printed circuit board (37) with edge connector (38) for insertion into a slot (13, 14, 15, 16) of said first 
plurality, a second plurality of connection slots (39... 44) for memory modules of DIMM type, at least one memory 
module of DIMM type plugged into a slot of the said second plurality and a plurality of latch registers (45.. .50) 
each associated with a slot of said second plurality, each controlled by one of said latch commands ALE(0...5) for 
latching said column address received through said edge connector (38) and for presenting it to address terminals 

of the associated slot via a system bus, a row address and a column address placed at successive time intervals 

on an address output channel, a pair of a plurality of distinct timed signals validating row and column addresses 
respectively, selectively directed to one and only one of said modules, one of a plurality of distinct timed write 
commands and one of a plurality of distinct latch commands for said column address, selectively directed to one 
and only one of the said memory blocks, a first plurality of connection slots mounted on said motherboard and 
interconnected with said memory control unit in order to receive, from said memory control unit, said row and 
column addresses, said write commands, said latch commands and said plurality of timed signals for validating row 
and column addresses and, inserted in said slots, one or more memory components of the same type, chosen from 
the following two types: memory modules of commercial DIMM type, and expansion supports for memory modules 
of commercial DIMM type, each expansion slot comprising a printed circuit board with an edge connector for 
insertion into a slot of said first plurality, a second plurality of connection slots for memory modules of DIMM 
type, at least one memory module of DIMM type plugged into a slot of the said second plurality and a plurality of 
latch registers each associated with a slot of said second plurality, each controlled by one of said latch commands 
for latching said column address received through said edge connector and for presenting it to address terminals 
of the associated slot. 
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Delay circuitry variable while operating - stores discrete signal samples at intervals determined by clock 
signal in sequence in chain of memory elements 
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Alerting Abstract DE Al 

The delay circuit stores discrete signal samples at intervals determined by a clock signal (CL) in sequence in a chain 



of memory elements to provide a predefined delay time. Each memory element operates a driving device. 
A shift register arrangement is formed from a chain of time-base circuits (20). Each driving device includes one of 
the time-base circuits. All time-base circuits are switched by the clock signal. A control arrangement supplies a start 
pulse to the first time-base circuit at a first time, tl. The start pulse is passed through the shift register arrangement 
until a second time, t2. The time between tl and t2 is a selectable whole number of the period of the clock cycle. 
ADVANTAGE - Can be adapted, pref. switched, to provide multiple different delay times in simple way. 
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Alerting Abstract ...A shift register arrangement is formed from a chain of time-base circuits (20). Each driving 
device includes one of the time-base circuits. All time-base circuits are switched by the clock signal. A control 
arrangement supplies a start pulse to the first time-base circuit at a first time, tl. The start pulse is passed through the 
shift register arrangement until a second time, t2. The time between tl and t2 is a selectable whole number of the 

period of the clock cycle ADVANTAGE - Can be adapted, pref. switched, to provide multiple different delay 

times in simple way. Original Publication Data by Authority.. .Original Abstracts:output circuit (13) of the memory 
arrangement (11) following in the series, by in each case one actuating device (14, 15, 20), comprising a shift 
register arrangement formed from a chain of bistable flipflop circuits (20),in which arrangement the output (25) 
of in each case one of the flipflop circuits (20) is connected to the input of the following flipflop circuit (20) in the 
chain, each actuating device (14, 15, 20) comprises in each case one of the flipflop circuits (20) and all flipflop 
circuits (20) are switched by the clock signal (CL), and comprising a command arrangement (35) which, at a first 
time (tl) supplies a (first) start pulse (ST) to the first flipflop circuit (20) in the shift register arrangement and 
enables the shift register arrangement for forwarding the start pulse (ST) through the chain of flipflop circuits 
(20) as determined by the clock signal (CL), and which interrupts the forwarding of the (first) start pulse (ST) at a 
second time (tlO) and, at the same time,supplies a next start pulse (ST) to the first flipflop circuit (20) in the shift 
register arrangement and again enables the shift register arrangement for forwarding this next start pulse (ST), 
the time interval between the first (tl) and the second (tlO) time being a selectable integral multiple of periods of 
the clock signal (CL) and the number (n) of the memory arrangements (1 1) or respectively of the flipflop circuits 
(20) corresponding at least to this multiple (n) of the periods of the clock signal (CL). In operation, this circuit 
arrangement can be adapted in a simple manner to a multiplicity of different values for the desired delay time 
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samples in a row of storage devices at time intervals which are determined by a clock signal and is read therefrom 
after expiration of a selectable delay time. Each storage device is connectable, via a respective input circuit to a 

useful signal input and, via a respective output circuit, to a useful signal output. The device being activatable, 

together with the output circuit of the next storage device in the row, by a respective activation device, which 
includes a shift register device formed by a chain of bistable trigger circuits in which the output of each of the 
trigger circuits is connected to the input of the next trigger circuit in the chain, each activation device including one 
of the trigger circuits and all trigger circuits being switched by the clock signal, and also comprising a command 
device which applies a (first) start pulse to the first trigger circuit in the shift register device at a first instant and 
which enables the shift register device to propagate the start pulse through the chain of trigger circuits in 
conformity with the clock signal and which interrupts the propagation of the (first) start pulse at a second instant and 
at the same time applies a next start pulse to the first trigger circuit in the shift register device and enables the shift 
register device again to propagate said next start pulse, the time interval between the first instance (tl) and the 
second instant (tlO) amounting to a selectable, integer multiple of periods of the clock signal, the number (n) of 
storage devices or trigger circuits corresponding at least to said multiple (n) of the periods of the clock signal. This 
circuit arrangement can be simply adapted to a plurality of different desired delay time values, during operation. 
...Claims:tlO) die Weiterleitung des (ersten) Startimpulses (ST) unterbricht und zugleich der ersten Kippschaltung 
(20) in der Schieberegisteranordnung einen nachsten Startimpuls (ST) zuleitet und die Schiebe registeranordnung 
fUr die Weiterleitung dieses nachsten Startimpulses (ST) erneut freigibt, wobei die Zeitspanne zwischen dem ersten 

(tl) und dem zweiten (tlO) Zeitpunkt ein wahlbares, ganzzahliges Vielfaches Taktsignals (CL) betragt und die 

Anzahl (n) der Speicheranordnungen (11) bzw. der Kippschaltungen (20) mindestens diesem Vielfachen (h) der 
Perioden des Taktsignals (CL) entspricht. 1... signal samples in a row of storage devices (1 1) at time intervals 
determined by a clock signal (CL) and is read therefrom after expiration of a selectable delay time, each storage 
device (1 1) being connectable, via a respective input circuit (12), to a useful signal input (30) and, via a respective... 
... device (11) being activatable, together with the output circuit (13) of the next storage device (1 1) in the row, by a 
respective activation device (14, 15, 20), comprising a shift register device formed by a chain of bistable trigger 

circuits (20) in which the output (25) of each of the trigger circuits (20) is connected next trigger circuit (20) in 

the chain, each activation device (14, 15,20) comprising one of the trigger circuits (20) and all trigger circuits (20) 
being switched by the clock signal (CL), and also comprising a command device (35) which applies a first start 
pulse (ST) to the first trigger circuit (20) in the shift register device at a first instant (tl), enables the shift register 
device to propagate the start pulse (ST) through the chain of trigger circuits (20) in conformity with the clock signal 
(CL), and interrupts the propagation of the first start pulse (ST) at a second instant (tlO) and at the same time applies 
a next start pulse (ST) to the first trigger circuit (20) in the shift register device and enables the shift register device 
again to propagate said next start pulse (ST), the time interval between the first instant (tl) and the second instant 
(tlO) amounting to a selectable, integer multiple of periods of the clock signal (CL),the number (n) of storage 
devices (1 1) or trigger circuits (20) corresponding at least to said multiple (n) of the periods of the clock signal 
(CL). I claim: A device for delaying a functional signal by a selectable number of discrete clock pulse periods, said 

device comprising:input means for receiving said functional signal:a series of storage devices each having a a 

delayed said functional signal, said output means being switchably coupled to said storage elements, and said 
delayed functional signal being received by said output means parallelly from said storage elements;a shift register 
having a series of stages each stage being coupled to a single one of said storage devices;clock means coupled to the 
shift register for recurrently providing shift pulses to said shift register; andload means coupled to the shift register 
for providing at selectable recurrence intervals a control signal to said shift register which control signal is serially 
shifted through said series of stages and wherein said control signal, when shifted into a particular stage, controls 
loading of the storage elements, which storage element is within the storage device coupled to the particular stage; 
with the sample of the functional signal and controls unloading of a delayed sample of the functional signal which 
was stored in a storage element of a next storage device in said series of storage devices. 
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Abstract: This paper presents a new method for improving the timing behaviour of digital circuits, which 
contain enable-registers and, e.g., come from the high level synthesis. Known techniques optimize all long 
combinational paths assuming only one clock cycle between registers. But enable-registers cause also paths 
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the quotient register, and at the end of the second cycle, this bit is left shifted one position and the second quotient 
bit is latched in the last position of the register, and so on. 

Identifiers: MICROPROCESSOR CHIP; LSSD (LEVEL SCAN SENSITIVE DEVICE) LATCHES; QUOTIENT 
REGISTER; LATCH CHAIN 



